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Abstract —Motivated by the demand for energy-efficient commu¬ 
nication solutions in the next generation cellular network, a mixed- 
ADC receiver architecture for massive multiple input multiple 
output (MIMO) systems is proposed, which differs from previous 
works in that herein one-bit analog-to-digital converters (ADCs) 
partially replace the conventionally assumed high-resolution ADCs. 
The information-theoretic tool of generalized mutual information 
(GMI) is exploited to analyze the achievable data rates of the 
proposed system architecture and an array of analytical results of 
engineering interest are obtained. For deterministic single input 
multiple output (SIMO) channels, a closed-form expression of 
the GMI is derived, based on which the linear combiner is 
optimized. Then, the asymptotic behaviors of the GMI in both 
low and high SNR regimes are explored, and the analytical results 
suggest a plausible ADC assignment scheme. Finally, the analytical 
framework is applied to the multi-user access scenario, and the 
corresponding numerical results demonstrate that the mixed system 
architecture with a relatively small number of high-resolution ADCs 
is able to achieve a large fraction of the channel capacity without 
output quantization. 

Index Terms —Analog-to-digital converter, generalized mutual 
information, massive MIMO, mixed architecture, multi-user access. 

I. Introduction 

The prosperity of mobile Internet calls for new technologies to 
meet the exponential increase in demand for mobile data traffic. 
In recent years, a heightened attention has been focused on 
massive multiple input multiple output (MIMO) systems, which 
achieves significant gains in both energy efficiency and spectral 
efficiency, and thus are envisioned as a promising key enabler 
for the next generation cellular network [1] [2], 

Thus far, most of the works on massive MIMO assume 
perfect hardware implementation. However, this assumption is 
not well justified, since the hardware cost and circuit power 
consumption scale linearly with the number of BS antennas 
and thus soon become economically unbearable unless low- 
cost, energy-efficient hardware is deployed which however easily 
suffers from impairments. Among various sources of hardware 
impairment, low-resolution analog-to-digital converters (ADCs) 
have attracted ubiquitous attention due to their favorable prop¬ 
erty of low cost, low power consumption and feasibility of 
implementation [3]. For Nyquist-sampled real Gaussian channel, 
the authors of [4] established some general results regarding 
low-resolution output quantization. The authors of [5] designed 
a modified minimum mean square error (MMSE) receiver for 
MIMO systems with output quantization. In [6], the authors 
investigated a practical monobit digital receiver paradigm for 
impulse radio ultra-wideband (UWB) systems. Recently, the 
authors of [7] examined the impact of one-bit quantization on 


achievable rates of massive MIMO systems with both perfect 
and estimated channel state information (CSI). The authors of 
[8] addressed the high signal-to-noise (SNR) capacities of both 
single input multiple output (SIMO) and MIMO channels with 
one-bit quantization. 

Despite of its great superiority in deployment cost and energy 
efficiency, one-bit quantization generally has to tolerate large rate 
loss, especially in the high SNR regime [8], thus highlighting 
the indispensability of high-resolution ADC for digital receiver. 
Motivated by such consideration, in this paper we propose a 
mixed receiver architecture for massive MIMO systems in which 
one-bit ADCs partially, but not completely, replace convention¬ 
ally assumed high-resolution ADCs. This architecture has the 
potential of allowing us to remarkably reduce the hardware cost 
and power consumption while still maintain a large fraction of 
the performance gains promised by massive MIMO. 

Recognizing the challenge in working with the channel capac¬ 
ity directly, we take an alternative route and seek to characterize 
the achievable data rates specified to certain encoding/decoding 
scheme. To this end, we exploit the information-theoretic tool 
of generalized mutual information (GMI) [9] to address the 
achievable data rates of our proposed system architecture. As a 
performance measure for mismatched decoding, GMI has proved 
convenient and useful in several important scenarios such as 
fading channels with imperfect CSI at the receiver [9] and 
channels with transceiver distortion [10], [11]. 

Exploiting a general analytical framework developed in [10], 
we obtain a series of analytical results. First, we consider a 
deterministic SIMO channel where the BS is equipped with N 
antennas but only has access to K pairs of high-resolution ADCs 
and (N — K) pairs of one-bit ADCs, and derive a closed-form 
expression of the GMI for general ADC assignment and linear 
combiner. This enables us to optimize the linear combiner design 
and further explore the asymptotic behaviors of the GMI in both 
low and high SNR regimes that in turn help us suggest a plausible 
ADC assignment scheme. The corresponding numerical results 
indicate that even with a small number of high-resolution ADCs, 
our system architecture already achieves a substantial fraction of 
the channel capacity without output quantization, thus verifying 
the effectiveness of the mixed architecture. Further issues such 
as ergodic fading channels with or without channel estimation 
error are included in an extended work [12]. 

Then, we apply our analysis to the multi-user access scenario. 
Numerical results reveal that the mixed system architecture with 
a small number of high-resolution ADCs achieves a large fraction 
of the channel capacity without output quantization, provided 
that the multi-user system is properly loaded. In summary, the 


proposed mixed architecture strikes a reasonable and attractive 
balance between cost and spectral efficiency, for both single-user 
and multi-user scenarios. Thus we envision it as a promising 
receiver paradigm for energy-efficient massive MIMO systems. 

The remaining part of this paper is organized as follows. 
Section II outlines the system model. Adopting GMI as the 
performance metric, Section III establishes the theoretical frame¬ 
work for deterministic SIMO channels, based on which the 
optimal linear combiner design and the asymptotic behaviors 
of the GMI in both low and high SNR regimes are explored. 
Section IV extends the theoretical framework to the multi-user 
access scenario. Numerical results are given in Section V to 
corroborate the analysis. Finally, Section VI concludes the paper. 

Notation: Throughout this paper, we use ||x|| to represent 
the 2-norm of vector x, and let X*, X T and X /; denote the 
conjugate, transpose and conjugate transpose of X, respectively. 
Complex Gaussian distribution with mean // and variance cr 2 is 
denoted by CN(/x, <r 2 ), while CN(/r, C) stands for the distribu¬ 
tion of a circularly symmetric complex Gaussian random vector 
with mean //, and covariance matrix C. We use log(x) to denote 
the natural logarithm of positive real number x. 

II. System model 

We start by focusing on a single-user system where a single¬ 
antenna user communicates with an A'-an term a BS. In this 
paper, we consider a narrow-band channel model, for which the 
frequency-flat fading channel h is chosen according to CN(0,I) 
and is fixed throughout the transmission of the codeword. More¬ 
over, the realization of the channel is assumed to be perfectly 
known by the BS and thus is deemed as deterministic in the 
subsequent analysis. The received signal at the BS can be 
expressed as 

y l = hx* + z l , for l = 1,2,..., L, (1) 

where x l is the complex signal transmitted at the /-th symbol 
time, z l ~ CN(0. cr 2 !) models the independent and identically 
distributed (i.i.d.) complex Gaussian noise vector, and L is the 
codeword length. 

We consider a mixed architecture, as illustrated in Figure 1, 
in which only 2 K high-resolution ADCs are available and all 
the other 2 (N — K) ADCs are with only one-bit resolution. 
We further let the I/Q outputs at each antenna be quantized by 
two ADCs of the same kind. Thus the quantized output can be 
expressed as 

r l n = 5 n - (h n x l + z l n ) + S n ■ sgn(h n x l + z l n ), (2) 

for l = 1 ,...,L, n = 1 Here 8 n = 1 — 5 n , and 

8 n G {0,1} is the binding indicator: 5 n = 1 means that the 
ADCs corresponding to the n-th antenna are high-resolution, 
whereas 8 n = 0 indicates that they are with one-bit resolution. 
Note that here we assume sufficiently high resolution, so that the 
residual quantization noise is negligible, for 8 n = 1. For notation 
simplicity, we further define a binding vector 8 = [<5j,..., Ay] 7 , 
which should be optimized according to the channel state h so 
that the limited number of high-resolution ADCs will be well 
utilized to enhance the system performance. 

For transmission of rate R, the user selects a message m 
from M = {1,2,..., [2 LR \} uniformly randomly, and maps 
the selected message to a transmitted codeword, i.e., a length- 
L complex sequence, {x l (m)}jL 1 . In this paper, we restrict 



Fig. 1. Illustration of the system architecture. 


the codebook to be drawn from a Gaussian ensemble; that is, 
each codeword is a sequence of L i.i.d. CN(0, £ s ) random 
variables, and all the codewords are mutually independent. Such 
a choice of codebook satisfies the average power constraint 
j- ®‘[| a: *( TO )| 2 ] < £s- We thus define the SNR as SNRd = 
£ s /cr 2 , and let cr 2 = 1 thereafter for convenience. 

For an iV-anteiina SIMO channel, we let C(N, K \, K 2 ) denote 
its capacity when equipped with K\ pairs of high-resolution 
ADCs and K 2 pairs of one-bit ADCs, where 0 < K\, K 2 , K\ + 
K 2 < N. As discussed in the introduction, the evaluation 
of C(N,K,N — K) is a formidable task. Therefore in the 
following, we adopt the nearest-neighbor decoding rule at the 
decoder, and leverage the general framework developed in [10] 
to investigate the GMI of our proposed system architecture. The 
GMI acts as an achievable rate and thus also a lower bound of 
C{N,K,N — K ). To this end, we introduce a linear combiner 
to process the channel output vector, as illustrated in Figure 1. 
Thus the processed channel output is 

x l =w H r l , (3) 

for l = 1 where w is designed according to the channel 

state h and the binding vector 8. 

With nearest-neighbor decoding, upon observing {x l }fL v the 
decoder computes, for all possible messages, the distance metrics 

1 L 

D(m) = — \x l — ax l (m)\ 2 , m G M, (4) 

1 ^ ^' 
i=i 

and decides the received message as the one that minimizes (4). 
The scaling parameter a is selected appropriately for optimizing 
the decoding performance. 

III. GMI and Optimization of Combiner 
A. GMI of the Proposed System Architecture 

From now on, we suppress the time index l for notational 
simplicity. To facilitate the exposition, we summarize (2) and 
(3) as 

x = w H r = f(x,h,z), (5) 

where /(•) is a memoryless nonlinear distortion function that 
incorporates the effects of output quantization as well as linear 
combining. Although 5 and w are made invisible in the function 
/(•) since they are both determined by h, we need to keep in 
mind that /(•) implicitly includes 8 and w. 

We apply the general framework developed in [10] to derive 
the GMI of our proposed system architecture. Particularly, em¬ 
ploying the similar procedure as [10, Appendix C], we have the 
conditional GMI given as follows. 


























Proposition 1: With Gaussian codebook ensemble and 
nearest-neighbor decoding, the GMI conditioned on w and 5 is 
given as 

k(w, £) 


7gmi(w, <5) = log 1 + 


1 — k(w, 8 ) 


( 6 ) 


(7) 


where the parameter re(w, 8) is 

|E[/*(a:,h,z) -x]\ 2 

K<W " 5) = £,£(!/(*, h, a) |*] ' 

The corresponding optimal choice of the scaling parameter a is 

E[/(ir,h,z) • x *] 


a op t(w, 8) = 


( 8 ) 


Note that k(w, 8 ) is the squared correlation coefficient of 
channel input x and the processed output fix. h, z), and thus 
is upper bounded by one, from Cauchy-Schwartz’s inequality. 
Moreover, /gmi(w, 8 ) is a strictly increasing function of k(w. 8 ) 
for k( w, 8) £ (0,1). Therefore in the following, we will seek to 
maximize «(w, 8) by choosing well designed linear combiner 
w and binding vector 8. To start with, we derive a closed-form 
expression of k(w, 8 ), given as follows. 

Proposition 2: Given w and 8, for (7) in Proposition 1, we 
have 

w H R ri .R*Lw 


k(w, 8 ) = 


£ s w ff R rr w 


(9) 


where R r , = E[ra;*] is the correlation vector between r and x, 
with its n-th element being 


(B, r:c ) n h n L s 


7t(|/i„| 2 £ s +1) 


( 10 ) 


and R rr = E[rr ff ] is the covariance matrix of r, with its (n, m)- 
th entry being (R rr ) n>m = 


1 + 8 n ■ \h n \ 2 L s + 8„ 


if n = ' 


h n h* m £. s 


8n.8m. ■ - 


SnSn 

8n8m 

arcsin 


8 n 8„ 


ir(|/i m | 2 £ s + l) 


7r(|/t„| 2 £ s + l) 

(h n h* m ) R £ 


\Vl ft n| 2 £s + 1 V / l ,lm | 2e3 + 1 


i • arcsm 


in 


£ s 


V v /|ft„| 2 £ 8 +Vl ft ™l 2 £ o + 1 


if n m. 


( 11 ) 

The corresponding optimal choice of the scaling parameter a in 
(8) is 

a op t(w, 8 ) = ^-w h R„,. (12) 


Proof: See [12, Appendix A]. 


B. Optimization of Linear Combiner 

In this subsection, we turn to optimize w such that the GMI 
is maximized for given h and 8. The subsequent proposition 
summarizes our result. 

Proposition 3: For given h and <5, the optimal linear combiner 
w takes the following form 

W Q pt — R rr H-rx} (13) 


which is in fact a minimum mean square error (MMSE) combiner 
that minimizes the mean squared estimation error of x upon 
observing r. The corresponding k( w, 8) is 

^t(w 0 pt,<5) — u 0 pt(w Q pt, — ~r— R r;r R rr R ra; . (14) 

Proof: See [12, Section III-B] for the proof. 


C. Asymptotic Behaviors o//GMi( w optj 8) 

In the previous subsection, the optimal linear combiner for our 
proposed framework is derived. Thus we are ready to examine 
its asymptotic performance in both low and high SNR regimes. 
Letting SNR tend to zero, we have the following corollary. 
Corollary 1: As £ s —► 0, for given 8 we have 

N f 2 \ 

7GMi(w op t, 8) = y l8„ + S„--J | hn | 2 £s + o(£ s ). (15) 

n—l ' n2 


See [12, Appendix B] for its proof. Comparing with 
C(N,N, 0) in the low SNR regime, i.e., C(N,N,0) = 
Yln=i |/tn| 2 £s + o(£ s ), we conclude that part of the achievable 
rate is degraded by a factor of ^ due to one-bit quantization. 

For the high SNR case, the subsequent corollary collects our 
results. 

Corollary 2: As £ s — > oo, for given 8 we have the effective 
SNR in (6) as 

«(wopt, 6) ,, ||2 c [4 + Q(l/£ s )]q g B- 1 q 

1 - re(w op t,<J) s 7t - [4 + 0(l/£ s )]q ff B- 1 q’ 

(16) 

with p, q, and B given by 


P 

q 


A 

A 


(B) 


n,m 


A 


\h 1: ..., h K ] T , 

[h.K+i/\h.K+i\> Ajv/IAjvI] : 


arcsm 


f (. hn+Khm+K ) P 

V \hn+Khm+K\ 


(17) 

(18) 


i ■ arcsin 


( (. hn+Khm+K ) J A 

V \kn+Kt£n+K I / 


+ 0(l/£ s ),(19) 


where h is a rearrangement of h, by stacking the channel 
coefficients of the antennas equipped with high-resolution ADCs 
in the first K positions of h. 

See [12, Appendix C] for the proof. An observation from 
(16) indicates that the contributions of high-resolution ADCs and 
one-bit ADCs in the high SNR regime are separate, as the first 
term corresponding to high-resolution ADCs increases linearly 
with £ s , whereas the second term coming from one-bit ADCs 
tends to be a positive constant independent of £ s . Comparing 
with Corollary 1, we infer that one-bit ADCs are getting less 
beneficial as the SNR grows large, as will be validated by 
numerical study in Section V. Moreover, both (15) and (16) 
suggest that high-resolution ADCs should be assigned to the 
antennas with the strongest K link magnitude gains, at least in 
the low and high SNR regimes. 


IV. Extension to multi-user scenario 

In this section, the BS serves M single-antenna users simul¬ 
taneously. For simplicity, we focus on the deterministic channel 
case, and the channel matrix between the users and the BS is 
denoted by H = [hi,...,hjv] £ C Mx N , whose elements are 
i.i.d. CN(0,1), i.e., h„ = [h ln ,..., h Mn } T collecting the channel 































coefficients related to the n-th BS antenna. There are still only 
K pairs of high-resolution ADCs available at the BS. Thus we 
rewrite the quantized output at the n-th antenna, with user j 
considered, as 


( M \ - [ M \ 

T ' n ' 1 = Sn ■ I ^h.nXL + Zn I + 5 „-Sgn ( ^ h m X L + Z n J , ( 20 ) 

where x L ~ CN(0, £ s ) denotes the i.i.d. coded signal dedicated 
to the i-th user, and + z n summarizes the co¬ 

channel interference and noise for the considered user j. For 
a fair comparison, the SNR in this situation is defined as 
SNRd = A'/ £ s , reflecting the total transmit power from all users. 

Following a similar derivation procedure as that constructed 
in Section III, we get the GMI of the considered user. 

Proposition 4: For given H and 5, when treating other users’ 
signals as noise, the GMI of user j is 

( K mu \ 

1 + T^rJ’ (2D 

where the parameter k"" 1 is 

= 1 (R mu )ff (22) 

R™ u is the correlation vector between r mu and Xj , with its n-th 
entry given as 


(Rr™)n = h jn E 


7r(||h„|| 2 £ s + 1) 


(23) 


and R“ u is the covariance matrix of r mu , with its (n, m)-th 
entry being (R“ u )„, m = 


1 + 6 n ■ ||h n || 2 £ s + 6 n , 


if n = ' 


h T h* £ 

AA n 1A m°s 


SnSm- = 


8n$m + dn<5m ‘ J ff (||h m |p£ s + l) + 


S n S m ■ J T (|| h „p£ s + 1) 


arcsm 


in( 


( h ^ h ^) :r £ 8 


Vvilh"ll 2 £s+iv / ll h -ll 2£ -+ 1 

. ( (h^h^) I £ s 

111 , - D —— - 


.■v/ll h ™ll 2 £a +i\/!l hm ll 2£s + 1 


/•arcMiil — .. I , if n / ill. 

(24) 

In the multi-user scenario, there is no clear clue about how to 
assign the high-resolution ADCs. For this reason, we consider 
two heuristic ADC assignment schemes in the numerical study. 
That is 


• Scheme #1: high-resolution ADCs are assigned to the 

antennas with the maximum \hj n \ 2 . 

• Scheme #2: high-resolution ADCs are assigned randomly. 

Scheme #1 follows from the asymptotic property of 7 Gmi in the 
low SNR regime, and the proof is omitted due to space limitation. 


V. Numerical Results 

In this section, we validate our previous analysis with numeri¬ 
cal results. The GMI and the channel capacity are both averaged 
over a large number of channel realizations, for both single-user 
and multi-user scenarios. 



Fig. 2. GMI of the proposed system architecture for different number of high- 
resolution ADCs, N = 100. 



Fig. 3. GMI of the proposed system architecture for different number of high- 
resolution ADCs, N = 400. 


A. GMI for Single-User Scenario 

We assign the high-resolution ADCs to the antennas with the 
strongest K link magnitude gains, as suggested by Corollary 
1 and Corollary 2. In Figure 2 and Figure 3, the solid curves 
represent /GMi(w op t, 6), the dashed lines correspond to the 
channel capacity without output quantization, i.e., C(N. N, 0) = 
log(l + ||h|| 2 SNRd), and the dash-dot curves account for the 
channel capacity under antenna selection, i.e., C(N,K, 0) = 
log(l + X^ =1 S n • || 2 SNRd). Several observations are in 
order. First, both figures clearly show that the gain of deploying 
more high-resolution ADCs decreases rapidly, and thus a small 
number of high-resolution ADCs actually already achieves a 
large fraction of C(N, N, 0). For example, our proposed system 
architecture with 10 pairs of high-resolution ADCs attains 84% 
of (7(100,100,0) when SNR d = OdB, and it achieves 91% of 
(7(100,100, 0) when K increases to 20. Besides, both figures in¬ 
dicate that, compared to the antenna selection paradigm, one-bit 
ADCs in our architecture are less beneficial when the SNR grows 
large, but still significantly improve the system performance in 
the low to intermediate SNR regime. 

B. GMI for Multi-user Scenario 

Now, we examine the feasibility of our proposed system 
architecture in the multi-user scenario. Figure 4 collects the 
result, where the solid curves correspond to Scheme #1 suggested 














































Fig. 4. GMI of the considered user j in multi-user scenario, N = 100, M = 10. 

by asymptotic analysis in the low SNR regime, the dash-dot 
curves are obtained by random assignment per Scheme #2, and 
the dashed lines refer to the per-user capacity with N pairs of 
high-resolution ADCs, i.e., -p- logdet(I + f s HH ff ). 

We notice that though Scheme #1 is only analytically validated 
for the low SNR case, it does achieve better performance than 
Scheme #2. For the special case of K = N, it is well known 
that the linear MMSE receiver is suboptimal for MIMO channel 
[13], and thus we observe a distinguishable gap between the 
per-user capacity and the GMI. Most importantly, here a small 
number of high-resolution ADCs also attain a large fraction of 
the channel capacity with N pairs of high-resolution ADCs. For 
example, when SNRd = OdB and N = 100, Scheme #1 with 
K = 10 achieves 76% of the per-user capacity with N pairs 
of high-resolution ADCs, and this number rises to 80% when 
K = 20. 

Figure 5 accounts for the impact of increasing number of 
users on the achievable sum rates, focusing on K = 20 and 
SNRd = OdB. For a fair comparison, we introduce another 
performance curve corresponding to MMSE receiver with N 
pairs of high-resolution ADCs. We conclude that our proposed 
architecture achieves satisfactory performance even when the 
system is heavily loaded, by noticing that, even for M = N = 
100, our proposed architecture still attains 70% of that achieved 
by MMSE receiver without quantization. 

VI. Conclusion 

In this paper, we propose a mixed-ADC receiver architecture, 
and leverage the GMI to analytically evaluate its achievable data 
rates under two scenarios. The corresponding numerical study 
concludes that the proposed architecture with a small number of 
high-resolution ADCs suffices to achieve a significant fraction 
of the channel capacity without output quantization, for both 
single-user and multi-user scenarios. Thus our approach provides 
a systematic way of designing energy-efficient massive MIMO 
receivers. 

A number of interesting and important problems remain 
unsolved beyond this paper, such as designing the optimal ADC 
assignment scheme for any SNR, especially for the multi-user 
scenario; extending the analysis to more comprehensive hard¬ 
ware impairment models beyond ADC; among others. Besides, 
in order to make this approach effective for wideband channels 
which are more prevailing in the future communication systems. 



Fig. 5. Sum GMI for users in multi-user scenario: increasing number of users, 
N = 100, K = 20, SNR d =0dB. 

it is particularly crucial to extend the analysis to frequency- 
selective fading channels. We note that this is feasible but beyond 
the scope of this paper, and will be treated in a future work. 
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